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The coiflete texts of 251^ trade books toT thildceh in 
the priBary gtade^ vere analyzed by cospoter. in order tOl^iiAt if y^ 
recarridg.tvo- and tbreervord strings, of Ahe 202,76'3'^*#p^sasple^ 
^at ^resulted, 89 tvo^vord striQgs occurred 100 tises «£^,Bbre,#and 
only two three-vord strings occurred sore than* 100 ti'ses^ These 
frequencies represent, respectively, V0.5X' and O.ISX of the tptAl 
nasber of itrings in the sasple. The investigatcrs propose that these 
Vlfesnlts are directly , applicable to classroos reading .intft^jidtioay and 
suggest that beginning readers can be taught su<fh vord strings in 
such the sase aanner as coaaon sight* words are presebtly taught in 
initial reading instruction. * (Bt) . ^ 
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^ Bepr6ductions\ supplied by EPB^ are the best that can be aade * 
^ ^ froa the original docusent. ^ 
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The concept of phrases or ^nanon wrd 
referr^ed to in this pap^r, is not ftew. As early/4s 1908., Huey 
noted that, 'Unitary recognition of ph/ases is/very cotmnon in 
reading, for mentally the words do i/t stan^ apart*' (p. 115).^ 

In a related statement, he claimed/that, '^he reader's acquire- 

/ 

aent of ease and power in reading coaes/through increasing 
ability to r^ad in larger uni^4"' (p. /le*) . The need to teach 
readers to attend to phrasea has re/entl^ been cited by Harris 

• * / / 

and Sipay (1971, 1975), de^hant (;970), Pftllinann, Rouch, Char 

# 

.and Defeoer (1978) . 

The purpose of thl4 investigation was |o identify capon t^wp- 

word tnd three-word s/rings in the' text of children's tradevboolts . 
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p ^ ' 

Procedures • , 

Ad stated in t-he paper- by Moe and Hopkins (1978), the inves- 
tigators betieved that- in order to sut^feantiate clalBS that, they ^ 
had, identified certain phrases which were used with any degree 
of .regularity in printed materials, a minimum of 50,000 running 
worts needed to be analyzed. Samples of hundreds of tltoosands of 
words were thougfrt- to be the rfost desfrabl^. To obtain a 'language 
torpua of 'acce^table size, the Lvespigators selected 250 widery 
read, highly VeConmended picture books^or 
tetween the years 1928 and 1977. ' ^ , • 

The complete text of each of the trade books frequently read 
by prliary-grade children was (keypunched, aresulting in a sample of 



202,763 words. The text *samt)l^ was- then analyzed via the Ho^ and 
• Hopkins (197^) computer progfam designed to identify recurring 
• \ word' Strings. The progranr is designed to identify all consecu7 
tive two-vord and three-wori strings occurring in the corpus. 
Ipfo nation regarding the""total humber of two-word and three-word 
strings- -encountered and the number of d"l^ferent strings represented 
^ is. provided on the printout/ The frequency with which the strings 
occur is tabulated beside the listing of each individual word 



group 



• ♦ 

Findings . 

Within the total corpus there were- 170,249 two-word strings. 
Of this- nitabe'r, 85,511 different; strings were ident-ified. However, 
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only 69 of these different two-word strings «fcurred 100 tiaes or 
•ore in the corpus. The Strings their frequency pf occurrence, 
'and thf percentage of the total corpus they represent are presented 
in. Table^l. . ' f ' ^ ' . • * 



Insert Table; T aboutT here 

— — — — — — • — — — — \ — -r — — — — — — — — — — — 

m 9 

There were -141,018 three-word strings encountered in the 



corpus, V 120,024 of which were* different, meting thap only** 15Z 
of thea. occurred sore' than one tiib^ in the text; *Only two of 
the threeWord. strings occurred moM than 100 tises. Table 2 
contains a listing of the ten *aDst ccwon three-word strings^ 
and their frequency of occurrencfe. 

• • ^ ✓ * » . 

Ihsert Ia))le 2 about here'. # 



% If one considers the cumulative frequency o'f .ail tffo-word . 
strings occurring more' than 100 tirca, it can be seen that these 
89 strings account for 10.52^ all strings pcourring'ln text. 
For three-word strings, the two strings that occ-ur more than 100 
tiaes account for -152 of the tptal niaber of strings in the corpus. 

Discussion ' ' V • 

There are several pbdervat.iona to be ^de about *the word 

• • • * 

St rings ^ appear ing^ in Tables 1 ^nd 2. First, while the ciaulative 
frequency of the two-word phrases is ndt m^eedingly high, it is of 
inteVesl;. to loo\ at the individual worda whsk sake up the t%#o-w>rd 



r 

ptTrases occurring 100 tlaes or more. It' is possible that there 
could have been 178* different words represented in these 89 phrases. 
Hovever, because of ^ the r^pe4te(f use of certain words, only 68. 
xdlffereftt words were used and the aiajority of these words (75Z) 
are within the 100 »ost frequently occur rii(g\ord8 on the Carroll, 
DavieS, Ricl^an (1971) ^pfrd list. Table 3 contains a listing of 



these wrds and th^ir frequency of occurrence. 



4 



Insets Table 3. about here 



A second observation to be made is that these zomon word 
strings are derived from the text of trade books coaaotfly used 
with prlJBAry grajie childten, books which typically are %nfitten 
wit'bcut the vocabulary control found in textbooks, particularly 
basal readers, written* for primary-grade students. tThe inves- 
tigators suspect that had the corpus been based on text materi- ; 
als used in. the primary grades, the comon swings would have 
occurred with greater frequency. 

The third observation deals with a cdmparison of the connon 
strings identified in the present study with the cosaon strings 
or phrases opcurring cm Dolch Sight Phrase Cards (1948). 
Dolch's phrase cards represent combinations of the 95 cononest 
^ nouns and the 220 words on the Dolch basic sight vocabulary list. 
Of the 144 phrase pards, none of the f 6 ^hree-=wor^ phrases appeared 
id.thin' the ten most coaabn three-word strings in the present study. 
It should be 'pointed out, howevfer, that if one examines the first 
two words in.Dolch's three-vord^hrases, 35, of these also appear 
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/ in the two-vord stdtngs occurring in the present investigation / ^ 
S\X of the*6& tvo'^rd phrases included in tl>e Dolch cards appear 
in the list of twfevord strings ociiurririg-more than 100 tines in 
• the present inv^igationl 



The inVest^lgators Relieve, that the results of *this study are 
directl:^ appli^lfble to classrooa reading instruction. * Since a 
•nvaiberof aywMn word strings can be identified in text, and vere 
identified i*: this 'study , beginning 'reader^ can be taught such * 
word styln^in auch the sa»e- Banner as coifcon sight vords are. 
presently Aught' in initial reading^lAstniction. 
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TABLE 1, 



MOST COMMON TWO-WORD SJRINGS 
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String 



Frequency 



J of. Total 

Strings 
Encountered 



' CuBulative 
Frequency 



in th4 


1153 


• oo 


of the 


862 


f, .51 


to the^ 


' 714 ' 


.42 


and the 


653 


. 38 


oti the 


692« / « 




said th« 


406 


- i 24 


he 


^76 


. 22 


it was 


369 


.22 


r 

^^nd he 


31? . 




at the 


315 


. 19 


was a ^ 


297 , • • 


. .17 


into the 


285 


.17 


all the 


279 


.16 


he said 


261 


. 15 


in a 


* 258 » 


.15 


out of 


254. 


.15 


a little^ 


240 


. 14 


the little 


234 


.14 


there was 


224 


.13 


for the 


222 


,13 


I an 1 


216 


.^13 


to be 


210 ^ *■ 


* iz 


he had*. v 


206 


.12 , 
.11 


ffott the 


197 


for a 


195 


' .11 


and a 


182 


.11 


with a 


178 


- .10 
.LO/ 


and I 


i70 


on his 


169 • . 


. • .10 • 


of his 


163 1 


^ .l(f 


( his mother 


163 


^ MO 


I have 


159 


.09 


with the 


158 


.09 


it is 


158 


.09 


to see 


158 


• .09* 


I will 


154 


.09 


went to 


"154 


.09 . 


began to 


L53 


" .09 


%#hen be ' 


153 , 


.09 


but the 


149 


' .09 



0.68 

'1,19 
1/-61 
1,.'99 

.2.34 • 

2.58 

2.80 

3.02 ■ 

3.21 

3.40 

3.57 

3.74 

3:90 

4.05 

4.20 

4.3S 

4.49 

4,63 

4.76* 

4.89 

5.02 

/l4 

5.26 

5.37 



5. 

5. 

'5 . 

5, 

5, 

5 

6 

6 

6 

6 

'6 
6 
6 
6 



48 
59 
69- 
79 
•89 
99. 
09 
18 
,27 
36 
.45 
."54 
.63 
.72 
.81 
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TABLE i - Continued 



String 



Freqiieticy 



X of Total 

StrJings 
Encountered 



€unulatiye 
Frequency 





aad she 


149 


. .09 . 
: .09 ^ 


6.99 




she said 




7.08 




a big 


. 148 


' .$9 - • 


. 7.^7 


- 


in bis 


148 


.09 


7.-26 


« 


she was 


145 


.08 


7.34 - 




• to get 


144 


.08 


7.42 • 




had a 


144 


.08 


V- '7.50 




and they 


144 


.05 


7.58 




have a 


141 • 


.08 
^.08 


7.66 


* 


the water 


139 


7.74 




^ou are 


137 


' .08 


7.82 - 




through the 


•137 


• .08" 


,7.90 




.the B\sn* y 
the other' 


137 


.08 


7.98 




137 


.08 • 


8.06 




to go 


135 


.08 


" 8.14 


• 


'like a 


134 


.08 


8.22 




eoing to 


134 


..08" 


8.30 ■ 


i 


they were 


134 


.08 


8.38 




the moon 


134 


.08 


8.4^# 




up and* 


13.2 


.08 


8.54 




. on a 


129 


.08 


8.62 




/ dovn the 


128 


.08 ■ 


8.70 




/ the way 


128 


.08 


8.78 




. he could 


126 


• .-07 


8 .,85 




and then 


1.26 , 


.07 


8-92 




cane to 


,123- ' • 


.07 


8.99 




over the 


'123 


.07 " 


9". 06* 




when the 


121 


.07 


• 9.13 


• 


as he 




.07 


9.20* 




.tHe ttan 


119 


.07 . 


9.27 




then he • 


119 

^117 


■ .-07 . 


9.34 




the tree 


.07 


9.41 




«ne day 


lt7 . 


".07 


9.48 




of a 


116 


.07 


9.55 




'is a 


115' ■ 


.cT? 


9.62 




and his 




U)7 


9.69 




with his 


\u 


,07 , 


9^.76 




up the 


H3 


.07 


9*5 • • 




he went 


113 


.07 


• 9.90 ■ 


• 


the big 


105 


.06 


9.96 , 
^6.02 ^ 




are you 


104 


.06 ' , 




Co his 


103 


.06 


10,08 
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m TABLE 1 - continued 



• 

* 


String 


* » • 

Freqiiency 


I of Total ' 
Strings ^ 
Encountered 


• 

CuBuIative 
Freqaency 

i 


V 

* 


back to 


102 


. .06 


10.20 


• 


but he 


U02 


.06 


10.20 




by the ^ 


' 101 


.06 


' 10.26>^ 


• 


the door 


100 


.06 / 


t 10. 3f ♦ 




did not 


ipo 


. .06 


10.38 . 1 




• ^ the sky ^ 


100 


.06 


10.44 




^do you 


100 


.06 


10.50 




• 




* 

a 





er|c 
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TABLE Z- '■ 
MOSI COIWON THKEE-WOiLD' SlRIJiGS 



String, 



Frequency 



1 of Total 
Strings • 
Encountered 



_ emulative 
Fr^qu^cy 



out of the 
there was a 
went to the 
It vas a 
back to the 
up and down 
go to the . 
the enfa of 
came to the' 
said the man 



113 
1^ 

59 

58 

47 

40 

39- ■ 

. 37 
36 ■ 
'36 



'.08 
.07 
.04 
.04 
.03 

-.03 
.03 
.03 
,02 
.02 



- .08 
-.15 
.19 
.23 

.26 

'- .29 
.32 
.35- 
.37 
.39 



'1^ 
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, , • TABLE 3^ ' . ^ 

FRBQUBUCY of words found in TVro-WORiT STRINGS 



. . Frequency (as found - 
Hotd in, Strings occurring 
100 cises or more) \^ 



Frequency (a:B foxtod ^ 
Wofd in strings occurring 
100 cimee' or aore) 



the 




29 


It ^ 




13 


he 


« 


12 
11 


end 




9 


hie 


s 


'?• 


wee* 




.5 ■ 


of 




4, 


M 




4* 


you • 




3 ^ 
3 ^ 


with 




eeld 




3 


on 




3 


in 




3 


ItF 




2 


little 


2 


for 




2 


had 




. 2 


. have ' 




2 


ie 




2 


went* 




, • 2 


when 


• 


2 ■ 


but 




2 


ehe 




2 


big* 




2 


they 




2 


are 




2 




A. 


2 


then 


0 


2 


at- 




* 1 


*ln^o 




1 


all 




l' 


out . 




1 


there 




1 



a2&* ' 


1 


be 


- i 


from 


1. 


mother*' 


1 


see 


1 


will 


1 


began* ' 


1 




1 


water 


, 1- 


thtfough* ^ " 


1 


iun* 




other 


1 


go* 


1 


like 1 ' , 


■ 1 


going* 


r 


were • 


'■ 1 


moon* 




down 


1 


way 


; .-1 


^ould 


' 1 


1 


over 


i 


as • 


1 


manA 


1 


tree* 


.-1 


one 


. 1 


day* * 


1 


back*. 


.1 


by 


1 


door* ' 


•a 


did 


1 


not * 




■ky* 


. * ^-1 


do 


1 



•Denotes word not within the 100 most frequently occurring words on the 
Carroll, Davies, lichnan (1971) word list • • i. 
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